Copy That! Editing Sequences by Copying Spans

نویسندگان

چکیده

Neural sequence-to-sequence models are finding increasing use in editing of documents, for example correcting a text document or repairing source code. In this paper, we argue that common seq2seq (with facility to copy single tokens) not natural fit such tasks, as they have explicitly each unchanged token. We present an extension capable copying entire spans the input output one step, greatly reducing number decisions required during inference. This means there now many ways generating same output, which handle by deriving new objective training and variation beam search inference handles problem. our experiments on range tasks language code, show model consistently outperforms simpler baselines.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Quadratic Spans of Periodic Sequences

Random binary sequences are required in many applications of modern communication systems and in designing reliable circuits. However, truly random sequences are often associated with extremely high costs, and are therefore infeasible to use. Deterministically generated sequences that pass certain statistical tests suggested by random sequences are often used instead and are referred to as pseu...

متن کامل

Network growth by copying.

We introduce a growing network model in which a new node attaches to a randomly selected node, as well as to all ancestors of the target node. This mechanism produces a sparse, ultrasmall network where the average node degree grows logarithmically with network size while the network diameter equals 2. We determine basic geometrical network properties, such as the size dependence of the number o...

متن کامل

Spans of preference functions for de Bruijn sequences

A nonbinary Ford sequence is a de Bruijn sequence generated by simple rules that determine the priorities of what symbols are to be tried first, given an initial word of size n which is the order of the sequence being generated. This set of rules is generalized by the concept of a preference function of span n − 1, which gives the priorities of what symbols to appear after a substring of size n...

متن کامل

Enzyme-free genetic copying of DNA and RNA sequences

The copying of short DNA or RNA sequences in the absence of enzymes is a fascinating reaction that has been studied in the context of prebiotic chemistry. It involves the incorporation of nucleotides at the terminus of a primer and is directed by base pairing. The reaction occurs in aqueous medium and leads to phosphodiester formation after attack of a nucleophilic group of the primer. Two aspe...

متن کامل

D-form Sequences: Families of Sequences with Low Correlation Values and Large Linear Spans

Large families of binary sequences with low correlation values and large linear span are critical for spread spectrum communication systems. In this paper we describe a method for constructing such families from families of homogeneous functions over nite elds, satisfying certain properties. We then use this general method to construct speciic families of sequences with optimal correlations and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i15.17606